Talaria: Interactively Optimizing Machine Learning Models for Efficient Inference